Stereo Source Separation and Source Counting with MAP Estimation with Dirichlet Prior Considering Spatial Aliasing Problem

نویسندگان

  • Shoko Araki
  • Tomohiro Nakatani
  • Hiroshi Sawada
  • Shoji Makino
چکیده

In this paper, we propose a novel sparse source separation method that can estimate the number of sources and time-frequency masks simultaneously, even when the spatial aliasing problem exists. Recently, many sparse source separation approaches with time-frequency masks have been proposed. However, most of these approaches require information on the number of sources in advance. In our proposed method, we model the phase difference of arrival (PDOA) between microphones with a Gaussian mixture model (GMM) with a Dirichlet prior. Then we estimate the model parameters by using the maximum a posteriori (MAP) estimation based on the EM algorithm. In order to avoid one cluster being modeled by two or more Gaussians, we utilize a sparse distribution modeled by the Dirichlet distributions as the prior of the GMM mixture weight. Moreover, to handle wide microphone spacing cases where the spatial aliasing problem occurs, the indeterminacy of modulus 2πk in the phase is also included in our model. Experimental results show good performance of our proposed method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Blind Source Separation Based on Time-Frequency Sparseness in the Presence of Spatial Aliasing

In this paper, we propose a novel method for blind source separation (BSS) based on time-frequency sparseness (TF) that can estimate the number of sources and time-frequency masks, even if the spatial aliasing problem exists. Many previous approaches, such as degenerate unmixing estimation technique (DUET) or observation vector clustering (OVC), are limited to microphone arrays of small spatial...

متن کامل

Blind Source Separation with Distributed Microphone Pairs Using Permutation Correction by Intra-Pair TDOA Clustering

In this paper, we present a novel framework of distributed microphone array for blind source separation (BSS), where stereo microphones or proximately-placed microphone pairs are distributed. Unlike distributing all microphones individually, the time difference of arrival (TDOA) in the paired channels can be robustly estimated without suffering spatial aliasing. Based on it, sound sources are s...

متن کامل

Time Delay Histogram Based Speech Source Separation Using a Planar Array

Bin-wise time delay is a valuable clue to form the timefrequency (TF) mask for speech source separation on the twomicrophone array. On widely spaces microphones, however, the time delay estimation suffers from spatial aliasing. Although histogram is a simple and effective method to tackle the problem of spatial aliasing, it can not be directly applied on planar arrays. This paper proposes a his...

متن کامل

Blind Speech Separation Employing Directional Statistics in an Expectation Maximization Framework

In this paper we propose to employ directional statistics in a complex vector space to approach the problem of blind speech separation in the presence of spatially correlated noise. We interpret the values of the short time Fourier transform of the microphone signals to be draws from a mixture of complex Watson distributions, a probabilistic model which naturally accounts for spatial aliasing. ...

متن کامل

Source Separation and Density Estimation by Faithful Equivariant SOM

Jack D. Cowan Department of Math University of Chicago Chicago, IL 60637 [email protected] We couple the tasks of source separation and density estimation by extracting the local geometrical structure of distributions obtained from mixtures of statistically independent sources. Our modifications of the self-organizing map (SOM) algorithm results in purely digital learning rules which perform...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009